Text to Avatar in Multi-modal Human Computer Interface

نویسندگان

Yiqiang Chen

Wen Gao

Zhaoqi Wang

Changshui Yang

Dalong Jiang

چکیده

In this paper, we present a new text-driven avatar system, which consists of three major components, a text-to-speech (TTS) unit, a speech driven facial animation (SDFA) unit and a text-to-sign language (TTSL) unit. A new visual prosody time control model and an integrated learning framework are proposed to realize synchronization among speech synthesis, face animation and gesture animation, which is crucial for this multi-modal synthesis system. Given meaningful sentences, the text-to-sign language system combined with text-to-speech system produces visual prosody information including gesture animation parameters and timing information for text-to-speech unit. The text-to-speech system produces speech according to that timing information and some prosody rules. At last, speech will be used to drive Mpeg-4 based face animation directly with some rules for face expressions. This paper highlights synergies among audio, visual and gesture technology components. The performance of our system shows that the proposed algorithm is suitable, which greatly improves the realism of multi-model speech synthesis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Getting Closer – Tailored Multi-Modal Human-Computer Interaction

This paper outlines our vision of an advanced multi-modal call center using avatar technology, which adapts content, presentation, and interaction strategy to properties of the caller such as age, gender, and emotional state. User studies on Interactive Voice Response (IVR) systems have shown that these properties could be used effectively to “tailor” services to users who do not maintain perso...

متن کامل

Architecture of a multi-modal dialogue system oriented to multilingual question-answering

In this paper, a proposal of a multi-modal dialogue system oriented to multilingual questionanswering is presented. This system includes the following ways of access: voice, text, avatar, gestures and signs language. The proposal is oriented to the question-answering task as a user interaction mechanism. The proposal here presented is in the first stages of its development phase and the archite...

متن کامل

Intelligent Virtual Agent: Creating a Multi-modal 3D Avatar Interface

Human-computer interactions can be greatly enhanced by the use of 3D avatars, representing both human users and computer systems in 3D virtual spaces. This allows the human user to interface with the computer system in a natural and intuitive human-to-human dialog (human face-to-face conversation). Hence, continuing to blur the boundaries between the real and virtual worlds. This proposed avata...

متن کامل

A Multi-Modal System Intellectual Computer AssistaNt

The paper describes a multi-modal system ICANDO (an Intellectual Computer AssistaNt for Disabled Operators) developed by Speech Informatics Group of SPIIRAS and intended for assistance to the persons without hands or with disabilities of their hands or arms in human-computer interaction. This system combines the modules for automatic speech recognition and head tracking in one multi-modal syste...

متن کامل

Multi-modal Aided Presentation of Learning Information: a Usability Comparative Study

This paper presents a comparative two-group experimental study to explore if the addition of multimodal interaction metaphors would enhance the usability of e-learning interfaces. Two independent groups of users were involved in the experiment each of which tested one of the two interface versions provided by the experimental e-learning tool. The first interface was based on textual approach in...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Text to Avatar in Multi-modal Human Computer Interface

نویسندگان

چکیده

منابع مشابه

Getting Closer – Tailored Multi-Modal Human-Computer Interaction

Architecture of a multi-modal dialogue system oriented to multilingual question-answering

Intelligent Virtual Agent: Creating a Multi-modal 3D Avatar Interface

A Multi-Modal System Intellectual Computer AssistaNt

Multi-modal Aided Presentation of Learning Information: a Usability Comparative Study

عنوان ژورنال:

اشتراک گذاری